Multi-armed bandit

Results: 113



#Item
91Game theory / Cybernetics / Machine learning / Search algorithms / Learning / Reinforcement learning / Markov decision process / Multi-armed bandit / Algorithm / Statistics / Mathematics / Applied mathematics

Hedged learning: Regret-minimization with learning experts Yu-Han Chang [removed] CSAIL, Massachusetts Institute of Technology, 32 Vassar Street, Cambridge, MA[removed]USA Leslie Pack Kaelbling

Add to Reading List

Source URL: people.csail.mit.edu

Language: English - Date: 2005-11-02 21:38:45
92Game theory / Cybernetics / Machine learning / Search algorithms / Learning / Reinforcement learning / Markov decision process / Multi-armed bandit / Algorithm / Statistics / Mathematics / Applied mathematics

Hedged learning: Regret-minimization with learning experts Yu-Han Chang [removed] CSAIL, Massachusetts Institute of Technology, 32 Vassar Street, Cambridge, MA[removed]USA Leslie Pack Kaelbling

Add to Reading List

Source URL: www.machinelearning.org

Language: English - Date: 2008-12-01 11:15:56
93Stochastic optimization / Search algorithms / Unimodality / Real analysis / Convex analysis / Multi-armed bandit / Normal distribution / Markov decision process / Mode / Statistics / Mathematical analysis / Mathematics

Unimodal Bandits [removed] Jia Yuan Yu Ecole Normale Sup´erieure, HEC Paris, CNRS, France. Shie Mannor

Add to Reading List

Source URL: www.icml-2011.org

Language: English - Date: 2011-06-01 14:49:36
94Gittins index / Multi-armed bandit / Underwater acoustic communication / Normal distribution / Modem / Variance / Statistics / Decision theory / Design of experiments

Multi-armed Bandit Formulation for Autonomous Mobile Acoustic Relay Adaptive Positioning Mei Yi Cheung, Joshua Leighton, Franz S. Hover Abstract— We apply the stationary multi-armed bandit (MAB) formalism to the proble

Add to Reading List

Source URL: web.mit.edu

Language: English - Date: 2013-07-15 22:25:46
95Markov processes / Stochastic control / Information systems / Reinforcement learning / Markov decision process / Multi-armed bandit / Q-learning / Preference elicitation / Machine learning / Statistics / Dynamic programming / Artificial intelligence

Social User Agents for Dynamic Access to Wireless Networks P. Faratin and G. Lee and J. Wroclawski S. Parsons Laboratory for Computer Science

Add to Reading List

Source URL: groups.csail.mit.edu

Language: English - Date: 2010-02-11 14:14:32
96Game theory / Artificial intelligence / Search algorithms / Control theory / Game artificial intelligence / Minimax / Regret / Observability / Multi-armed bandit / Statistics / Decision theory / Mathematics

An adaptive algorithm for finite stochastic partial monitoring G´ abor Bart´ ok [removed]

Add to Reading List

Source URL: icml.cc

Language: English - Date: 2012-06-07 13:20:54
97Multi-armed bandit / Expected value / Decision theory / Search theory / Secretary problem / Statistics / Stochastic optimization / Machine learning

Journal of Economic Theory 101, 252280[removed]doi:[removed]jeth[removed], available online at http:www.idealibrary.com on Learning While Searching for the Best Alternative Klaus Adam European University Institute, Via

Add to Reading List

Source URL: adam.vwl.uni-mannheim.de

Language: English - Date: 2008-09-23 18:11:18
98Machine learning / Multi-armed bandit / Stochastic optimization / Decision theory / Gittins index / Reinforcement learning / Bandit / Kullback–Leibler divergence / Probability distribution / Statistics / Design of experiments / Statistical theory

A modern Bayesian look at the multiarmed bandit

Add to Reading List

Source URL: www.economics.uci.edu

Language: English - Date: 2011-03-31 14:42:20
99Mathematical optimization / Machine learning / Multi-armed bandit / Cybernetics / Reinforcement learning / Algorithm / Greedy algorithm / Dynamic treatment regime / Statistics / Mathematics / Stochastic optimization

Journal of Machine Learning Research[removed]Submitted 4/00; Published[removed]Algorithms for the multi-armed bandit problem Volodymyr Kuleshov

Add to Reading List

Source URL: www.cs.mcgill.ca

Language: English - Date: 2010-12-24 03:47:38
100Stochastic optimization / Mathematical analysis / Machine learning / Multi-armed bandit / Algorithm / Polylogarithm / Regret / Statistics / Mathematics / Decision theory

Reducing Dueling Bandits to Cardinal Bandits Nir Ailon Technion, Dept. of Computer Science, Haifa 32000, Israel NAILON @ TECHNION . AC . IL

Add to Reading List

Source URL: jmlr.org

Language: English - Date: 2014-06-18 10:58:08
UPDATE